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CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application is a Divisional of United States Patent Application No. 

09/415,148, incorporated by reference herein. 

REFERENCE TO COMPUTER PROGRAM LISTING/TABLE APPENDIX 
10 [0002] In accordance with 37 C.F.R. §1.96, this patent contains a computer software 

listing in a microfiche appendix. The listing includes 1 microfiche having 44 frames. 

COPYRIGHT NOTICE 

[0003] A portion of the disclosure of this patent document contains material which is 

15 subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent docimient or the patent disclosure as it appears in the Patent 
and Trademark Office patent file or records, but otherwise reserves all rights under the Copyright 
Law. 

FIELD OF THE INVENTION 
20 [0004] The invention relates generally to patent data mining and analysis tools, and, more 

specifically to a computer based method and apparatus for mining and displaying patent data. 

BACKGROUND OF THE INVENTION 
[0005] Intellectual property assets typically make up 60 - 80% of the value of most 

companies. Even though revenues from U.S. patents rose from $3 billion in 1960 to $60 billion 
25 in 1993, according to the United States Patent and Trademark Office, most traditional companies 

1 



still don't effectively manage their intellectual property assets, let alone track and analyze the 
intellectual property assets of their competitors. 

[0006] Patents usually make up the largest portion of the IP portfolio in any technological 

company. Companies have always used patents to prevent others from making, using or selling 

5 patented products or methods, or to force companies into licensing agreements. There are many 
good reasons for tracking, mining and analyzing intellectual property data, and particularly patent 
data. The profitability and growth of some companies is directly related to their ability to 
develop, defend, and commercialize key patents. Companies who are patent savvy can make 
more informed decisions about entering new technological areas. They can determine whether 

10 certain products warrant patent protection or would infringe the patents of others. They can 
better predict where their industry, and competitors, is headed. Analyzing patent data is essential 
in effective negotiation of licenses. It is crucial in determining the true value of a merger or 
acquisition candidate. It is usefiil in finding infiingers and in identifying licensable technologies. 
It is useful in finding prior art to invalidate the patents of another. Tracking patents, mining and 

15 analyzing patent data in core technologies and businesses, then, is clearly a key strategic priority 
for many companies. Yet, with over 10 million worldwide patents existing at present (nearly 6 
million of which have issued in the United States), and with well over 10,000 new applications 
for patents filed in an average week, it is exceedingly difficult to track, mine and analyze the 
enormous volume of available patent data. 

20 [0007] Recent software advances have addressed the problem. While private sector 

companies have offered patent database searching software for years, recently the United States 
Patent and Trademark Office offered full text searching of patents (and display of images) 



through its web site at www.uspto.gov . IBM also offers free patent searching through its web 
site at www.patents.womplex.ibm.com. Most commercially and publicly available software 
search engines use relatively rudimentary search logic (e.g., Boolean operators, etc.) In 1994, the 
first natural language patent search engine, MAPIT, was introduced by Manning & Napier 

5 Information Services, Inc. of Rochester, New York. This powerful product enables one to search 
the USPTO patent database back to 1971 usmg natural language phrases. With MAPIT, there is 
no need to learn or worry about the limitations inherent in Boolean logic. Best of all, the 
software intelligently ranks the patents found in the search according to relevance, and offers 
flexibility for the user to modify the contours of the search. 

10 [0008] Patent searching for patentability, validity, or infringement purposes are but a few 

types of data mining processes. There are many others. The goal is to first locate the relevant 
patents, and then to obtain as much tangible quantitative and qualitative information as possible 
about them. 

[0009] Quantitative measurements include data such as the number of patents held by 

15 companies in selected technology areas. They also include patenting trends extrapolated from 
the mined data. Quantitative measurements can include counting the number of claims in patents 
as well. 

[00101 Others are beginning to recognize and appreciate the need for mining and 

analyzing patent data. One such company is Aurigen, Inc. of Mountain View, Califomia, 
20 (formerly known as SmartPatents, Inc.). This company is the assignee of several patents related 
to data mining of patents, including U.S. Patent No. 5,799,325 (Rivette et al.) related to a system, 
method and computer program product for extracting, synchronizing, displaying, navigating and 



manipulating text and image documents simultaneously in electronic forai; U.S. Patent No. 
5,806,079 (Rivette et al.) related to a system, method and computer program product for using 
intelligent notes to organize, link, and manipulate disparate data objects; U.S. Patent No. 
5,809,318 (Rivette et al.) related to a method and apparatus for synchronization, displaying and 
5 manipulating text and image documents; and U.S. Patent No. 5,845,301 (Rivette et al.) related to 
a system, method and computer program product for displaying and processing notes containing 
note segments linked to portions of documents. 

[0011] Despite advances in quantitative and qualitative methods of patent data mining, 

much remains to be done, especially with respect to qualitative analysis of patents. One 
10 qualitative measurement relates to the "strength" of a patent, which is dependent upon a number 
of criteria. 

[0012] One such criterion relates to the claims of the patent. It is well known that the 

claims define the metes and bounds of an invention. Claims vary in scope, and experienced 
patent attorneys are routinely asked to read and evaluate the scope of patent claims. Claim scope 
15 is a quasi-subjective interpretation, but there are objective measurement criteria as well. For 
example, in general, the fewer elements in a claim - the broader its scope. The number of words 
in a claim can also be an indication of scope. 

[0013] Another qualitative criterion relative to patent claims relates to the type of 

invention being claimed. Patent attorneys and agents can readily discern the category of statutory 
20 subject matter of a patent claim pursuant to 35 U.S.C. §101 (e.g., article of manufacture, 
machine, composition of matter, process, and improvement). Although these interpretations are 
intuitive to most experienced patent attomeys, heretofore, no computer software product has 



been developed to perform this type of qualitative data mining and analysis, and determine the 
category of statutory subject matter to which a patent relates. 

[0014] Claim structure and scope are but two criteria that determine the "strength" of a 

patent. Another factor might be how many citations a patent contains to other patents or other 
5 patent references. As is well known, each filed patent application is subjected to a prior art 
search by the Examiner in the Patent Office. In addition, applicants and their attorneys have a 
duty to disclose to the Patent Office all information known to that individual to be material to 
patentability of the invention. These prior art references, identified both by the Patent Office and 
by the applicant and her attomey, and which comprise both patent and non-patent publications, 

10 appear on the face of the patent, and sometimes appear in the Backgroxmd of the Invention 
section of the patent. The number of patents cited might be an indication of the strength of the 
patent. For example, a "pioneer patent" directed to a revolutionary invention would typically cite 
few if any patents or publications. On the contrary, an improvement patent in a crowded art area 
is likely to cite a large number of other patents and publications, which might indicate a weaker 

15 patent. 

[0015] Another indication of patent strength is the number of patents that cite the patent 

in question. For example, a pioneer patent is likely to be cited more often than an improvement 
patent. What is needed, then, is a computer based method and apparatus for mining and 
displaying patent data, which method and apparatus performs both quantitative and qualitative 
20 analysis, interpretation and display. Ideally what is needed is a computer based method and 
apparatus for performing this analysis on a plurality of patents, with the ability to then rank the 
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patents according to a number of different criterion. What is also needed is a computer based 
method and apparatus for ranking one group of patent-related documents against another group. 

SUMMARY OF THE INVENTION 
[0016] The invention broadly comprises a computer based method for analyzing and 

5 ranking a set of patents according to strength, comprising analyzing the set of patents by 
consideration of objective parameter(s) of each patent in the set, the parameter(s) selected from 
the group consisting of the number of claims within each patent being ranked, the number of 
independent claims within each patent being ranked, the number of citations to prior publications 
cited by a patent examiner within each patent being ranked, the number of other patents which 

10 contain a citation to a patent being ranked, the number of elements in an independent claim of 
each patent, the number of elements in an exemplary claim of each said patent, and the number 
of terms found in both independent and exemplary claims. 

[0017J It is a general object of the present invention to provide a method and apparatus 

for analyzing a pool of patents and for determining and displaying a number of objective facts 
15 about each patent in the pool, including but not limited to how many independent claims are 
contained within each patent. 

[0018] It is another object of the present invention to provide a method and apparatus for 

analyzing a pool of patents and then ranking the patents in the pool according to strength as 
determined by either a predetermined formula or a user-modifiable formula. 
20 [0019] It is still another object of the present invention to provide a method and apparatus 

for analyzing a pool or group of patents and then ranking them against another group of patents. 
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[0020] These and other objects and advantages of the present invention will be readily 

appreciable from the following description of preferred embodiments of the invention and from 
the accompanying drav^dngs and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 is a screen capture illustrating a listing of patents mined using the present invention and 
displayed in ascending chronological order; 

Figure 2 is a screen capture illustrating a listing of patents mined using the present 
20 invention and displayed in descending chronological order; 

Figure 3 is a screen capture illustrating a listing of patents mined using the present 
10 invention and displayed in alphabetical order by assignee; 

Figure 4 is a screen capture illustrating a listing of patents mined using the present 
invention and displayed in alphabetical order by inventor; 

Figure 5 is a screen capture illustrating a listing of patents mined using the present 
invention and displayed in ascending order by U.S. Class; 
15 Figure 6 is a screen capture illustrating a listing of patents mined using the present 

invention and displayed in order by International Class; 

Figure 7 is a screen capture that illustrates how a user would select a category to 
initiate a new search using the present invention; 

Figure 8 is a screen capture that illustrates how a user would initiate a new search 
20 for patents in the "pool" having the keywords "instrument panel" in the title; 

Figure 9 is a screen capture that illustrates the results of a search for patents in the 
pool containing the keywords "instrument panel" as described with reference to Figure 8; 
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Figure 10 is a screen capture of the fiill text of U.S. Patent No. 5,487,558 obtained 
by clicking on the hyperlink labeled "Full Patent" as shown in the screen capture of Figure 9; 

Figure 1 1 is a screen capture of the patents cited in U.S. Patent No. 5,487,5 5 8 
obtained by clicking on the hyperlink labeled "Cited Patents" as shown in the screen capture of 
Figure 9; 

Figure 12 is a screen capture of the patents that cite U.S. Patent No. 5,487,558 
obtained by clicking on the hyperlink labeled "Citing Patents" as shown in the screen capture of 
Figure 9; 

Figure 13 is a screen capture of the Abstract of U.S. Patent No. 5,487,558 
obtained by clicking on the hyperlink labeled "Abstract" as shown in the screen capture of Figure 

9; 

Figure 14 is a screen capture of the Claims of U.S. Patent No. 5,487,558 obtained 
by clicking on the hyperlink labeled "Claims" as shown in the screen capture of Figure 9; 

Figure 15 is a screen capture of part of the image of the first page of U.S. Patent 
No. 5,487,558 obtained by clicking on the hyperlink labeled "Image" as shown in the screen 
capture of Figure 9; 

Figure 16 is a screen capture showing a table of patents in a pool, displayed in a 
"Short" 5 form; 

Figure 17 is a screen capture illustrating the "Filter" function of the invention; 
Figure 18 is a screen capture displaying patents obtained using a U.S. Class filter 
selection from the choices displayed in Figure 17; 
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Figure 19 is a screen capture that shows a listing of "classy cousins" of a "focus 
patent" mined and displayed by the present invention, where a "classy cousin" is defined to be a 
patent that shares the U.S. Class and at least one reference with the "focus patent"; 

Figure 20 is a screen capture that shows both a predetermined "Mapit Strength 
5 Formula" and a "User-modifiable Formula" used to rank the mined patents in the pool and also 
shows a representative ranking of mined patents in descending order according to the Mapit 
Strength Formula; and, 

Figure 21 is a screen capture similar to that shown in Figure 20 but arranged to 
show the representative ranking of mined patents in descending order according to the "User- 
1 0 modifiable Formula" ; 

Figure 22 is a screen capture similar to that shown in Figure 20 but arranged to 
show the representative ranking of mined patents in descending order according to total number 
of claims in each ranked patent; 

Figure 23 is a screen capture similar to that shown in Figure 20 but arranged to show the 
15 representative ranking of mined patents in descending order according to total number of 
independent claims in each ranked patent; 

Figure 24 is a screen capture similar to that shown in Figure 20 but arranged to 
show the representative ranking of mined patents in descending order according to total number 
of citations in other patents in the pool to each ranked patent; 
20 Figure 25 is a screen capture similar to that shown in Figure 20 but arranged to 

show the representative ranking of mined patents in ascending order according to the total 
number of search terms found in the first claim of each ranked patent; 
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Figure 26 is a screen capture similar to that shown in Figure 20 but arranged to 
show the representative ranking of mined patents in descending order according to total number 
of search terms found in the exemplary claim of each ranked patent; 

Figure 27 is a screen capture similar to that shown in Figure 20 but arranged to 
5 show the representative ranking of mined patents in descending order according to total number 
of elements in the first claim of each ranked patent; 

Figure 28 is a screen capture similar to that shown in Figure 20 but arranged to 
show the representative ranking of mined patents in descending order according to total number 
of elements in the exemplary claim of each ranked patent; 
10 Figure 29 is a screen capture illustrating a list of patents mined using the present 

invention, and also showing a drop-down box whereby a user can select an American or 
European "short-form" date representation or a long-form date representation; 

Figure 30 is a screen capture illustrating a list of patents arranged in ascending 
(earliest to latest) chronological order by issue date, where both the application date and issue 
1 5 date are displayed in a long-form format; 

Figure 31 is a screen capture illustrating the "My List" function of the present 
invention, where a user may place selected mined patents in a special folder labeled "My List" for 
fiiture reference and analysis; 

Figure 32 depicts a representative client personal computer in communication 
20 with a network and operatively arranged to implement the computer program of the present 
invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
[0021] At the outset, it should be appreciated that the present invention comprises a 

computer-based method for mining patent data, although the concepts of the invention could 
obviously be used for mining non-patent data as well. Moreover, the present invention can mine 

5 and analyze patent data from any suitable "pool" of patents. The pool may comprise one or more 
patents. They may be domestic (i.e., United States) or foreign patents. The pool may include 
patents that have been sorted or filtered (e.g., as by assignee, inventor, etc.) or may be unsorted. 
Although the present invention is particularly well suited for mining and analyzing patent data 
associated with a patent portfolio of a particular company, or assignee, the invention is not so 

10 limited. For example, the invention could be used to mine and analyze industry-specific patents; 
or patents in the name of a particular inventor. It could also be used to analyze the portfolio of a 
patent holding company. A particular advantage of the present invention is its ability to rank a 
set of patents according to strength, both against one another within the set, and also as a group 
against another group of patents. In the description of the preferred embodiment, we describe 

15 use of the invention in mining and analyzing patent data associated with a particular assignee, but 
the invention is not so limited. 

[0022] Although the preferred embodiment described herein illustrates use of the 

invention with a pool of United States Patents, obviously the technology of the invention could 
be used to mine data from foreign patents, utility models, inventor's certificates, industrial 
20 designs, design patents, patent applications, or any other comparable form of legal protection for 
intellectual property. In fact, the technology could be used to mine data from non-patent 
documents. In the description which follows, the terms "patent" and "patent-related document" 



are intended to include all of the above-recited types of documents related to the legal protection 
of intellectual property. 

[0023] In the drawings and written description of the invention, we utilize screen captures 

taken while operating the software to illustrate the best mode of the invention known to the 
5 inventors at the time of application for patent, and to enable those having ordinary skill in the art 
to use the invention. We also include a microfiche appendix containing the source code for the 
computer program of the invention to enable one having ordinary skill in the art to make the 
invention. The software of the present invention is operatively arranged to operate with a 
conventional web browser, such as those conmiercially available from Netscape or Microsoft 
10 Corporation. 

[0024] Adverting now to the drawings, we begin our description with an assumption that 

a patent search has already been conducted to identify a pool of patents for study, mining, and 
analysis. The pool of patents can be assembled using any number of tools. For example, one can 
use the commercially available MAPIT™ search tool to identify a group of patents for study, or 
15 any other suitable tool (such as the USPTO web site, for example). In the drawing figures, a 
patent search was completed to identify certain patents assigned to a common assignee 
(Mercedes-Benz). The raw electronic patent data used to represent the pool is available directly 
from the USPTO or from third party vendors. 

[0025] In the description that follows, the term "Exemplary Claim" is defined to be a 

20 claim in a United States patent determined by the Patent Office to be exemplary of the invention. 
In practice, this "exemplary claim" is determined by the Examiner who examined the application 
which matured in the patent. The exemplary claim, which is also published in the Official 



Gazette, is determined in accordance with Section 1302.09 of the Manual of Patent Examining 
Procedure as follows: 

Examiners, when preparing an application for issue, are to 
record the number of the claim selected for printing in the Official 
5 Gazette in the box labeled "PRINT CLAIM" on the face of the file 

wrapper. 

The claim or claims should be selected in accordance with 
the following instructions: 

(A) The broadest claim should be selected. 
10 (B) Examiners should ordinarily designate but one claim on 

each invention, although when a plurality of inventions are claimed 
in an application, additional claims up to a maximum of five may 
be designated for publication. 

(C) A dependent claim should not be selected unless the 
15 independent claim on which it depends is also printed. In the case 

where a multiple dependent claim is selected, the entire chain of 
claims for one embodiment should be listed. 

(D) In reissue applications, the broadest claim with changes 
or the broadest additional reissue claim should be selected for 

20 printing. 

M.P.E.P. §1302.9 (Notations on File Wrapper) July 1998 

[0026] Once a pool of patents has been assembled, the invention is capable of sorting the 

patents in a number of different ways. As shown in the screen capture of Figure 1 (to the right of 

the word "Sort"), the program can sort in ascending chronological order (labeled "Oldest"), 

25 descending chronological order (labeled "Newest"), by Assignee, Inventor, U.S. Class, or 

International Class. 

[0027] In Figure 1, the patents are displayed in ascending chronological order (earliest to 

latest). This is accomplished by clicking on hyperlink 10. The screen capture of Figure 1 shows 
complete data for only four patents, U.S. Patent Nos. 3,989,193 (the patent number is hidden by 
30 reference boxes 11, 12 and 13); 3,989,192; 4,043,146 and 4,050,239. The ascending 
chronological sort tool 10 of the invention (labeled "Oldest" to the fight of the word "Sort" on the 
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screen) lists the patents in chronological order according to issue date. In the listing shown in the 
screen capture of Figure 1, the American date format (mm/dd/yyyy) is used, although the date 
format is settable by the user. Further chronological sorting of the list shown in Figure 1 can be 
done using individual year tools 11, 12, 13, etc. For example, clicking on tool 12 would result in 

5 a list of patents that issued in 1972. 

[0028] The program of the invention is capable of mining and displaying extensive 

information about each individual patent in the pool, as well as performing a ranking of a group 
of patents (discussed infra). For example, in Figure 1, four patents are identified. For 
convenience, we discuss only U.S. Patent No. 3,989,192 (the patent listed second from the top). 

10 As seen in the drawing, the computer program of the invention has analyzed the patent, and 
displayed various types of information about the patent. Specifically, the program has 
determined the title of the patent to be DEVICE FOR VARYING THE GAS EXIT AREA OF 
AN EXHAUST NOZZLE FOR A JET DEFLECTING DEVICE. Two inventors have been 
identified: Heinrich Enderle and Alfred Jabs. The program also determines the application date 

15 (02/28/1975) and issue date (11/02/1976). The U.S. and International Classes identified on the 
face of the patent are identified and displayed. The program counts the total number of claims 
(1 1) in the patent, and determines how many of those are independent claims (4). The program 
then determines how many patents are cited by this patent ("Cited Patents: 6"), and how many 
other patents cite the patent ("Citing Patents: 2"). The "Cited Patents" are determined in 

20 accordance with Section 1302.12 (Listing of References) in the Manual of Patent Examining 
Procedure. 
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[0029] The next line lists the number of search terms found in the first claim of the patent 

(33), in the exemplary claim of the patent (33), the number of elements in the first claim (1) and 
the number of elements in the exemplary claim (1). 

[0030] Figure 2 is a screen capture similar to that of Figure 1, except showing a listing 

5 that results from sorting the patents in descending chronological order by clicking on hyperlink 
20. 

[0031] Figure 3 is a screen capture that shows a listing that results fi-om an assignee sort, 

achieved by clicking on hyperlink 30. In the screen capture shown, all patents listed include 

Mercedes-Benz as one of the assignees. 
10 [0032] Figure 4 illustrates the ability of the program to sort patents in the pool by 

inventor name. The patents are listed in alphabetical order by first listed inventor last name. 

This sort is achieved by clicking on hyperlink 40. In the screen capture of Figure 4, the first 

named inventor of the four patents shown has a last name beginning with the letter "A." 

[0033] Figure 5 is a screen capture that shows a listing that results from an ascending 

15 sorting by U.S. Class as noted on the face of each patent. This sort is achieved by clicking on 

hyperlink 50. 

[0034] Similar to the screen capture of Figure 5, the screen capture of Figure 6 shows a 

listing that results fi-om a sorting by International Class as noted on the face of each patent. This 
sort is achieved by clicking on hyperlink 60. 
20 [0035] Once a pool of patents has been identified, a user can search the pool for specific 

patents of interest. As shown in Figure 7, drop down box 70 recites a number of patent segments 
that can be searched (PatentNumber, Assignee, Inventor, etc.) The search keywords are entered 



into box 71, once a patent segment has been selected. In Figure 8 for example, the keywords 
"instrument panel" have been entered into search box 71, and the program has been configured to 
find all patents in the pool that contains those keywords in the title. 

[0036] The results of the search for all patents in the pool which contain the keywords 

5 "instrument panel" in the title is illustrated in Figure 9, where four patents of the pool are shown. 
The first patent in the list is U.S. Patent No. 5,487,558 for an invention entitled, 
"INSTRUMENT PANEL IN A MOTOR VEHICLE." The pertinent information about this 
patent is shown in a "long form," although the results of the search could be displayed in a short 
tabular form as described infra. Further information about the patent can be obtained by clicking 
10 on hyperlinks 72-77 as shown in Figure 9. For example, one can view the full text of the patent 
by clicking on hyperlink 72 (Full Patent) : a list of the patents cited by the patent by clicking on 
hyperlink 73 ( Cited Patents ): a list of the patents that cite the patent by clicking on hyperlink 74 
(Citing Patents) : the abstract of the patent by clicking on hyperlink 75 (Abstract) : the claims of 
the patent by clicking on hyperlink 76 (Claims); or the image of the actual patent (including the 
15 drawing figures) by clicking on hyperlink 77 (Image) . 

[0037] Figure 10 is a screen capture of the full text of United States Patent No. 

5,487,558, obtained by clicking on hyperlink 72 in Figure 9. 

[0038] Figure 1 1 is a screen capture of the references cited on the face of United States 

Patent No. 5,487,558, obtained by clicking on hyperlink 73 in Figure 9. 
20 [0039] Figure 12 is a screen capture containing a listing of the patents which cited United 

States Patent No. 5,487,558, obtained by clicking on hyperlink 74 in Figure 9. 
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[0040] Figure 13 is a screen capture of the Abstract of United States Patent No. 

5,487,558, obtained by clicking on hyperlink 75 in Figure 9. 

[0041] Figure 14 is a screen capture of the Claims of United States Patent No. 5,487,558, 

obtained by clicking on hyperlink 76 in Figure 9. 
5 [0042] Figure 15 is a screen capture of the image of United States Patent No. 5,487,558, 

obtained by clicking on hyperlink 77 in Figure 9. 

[0043] The patents obtained as a result of a data mining operation may be displayed in 

various formats using the software of the present invention. Figure 16, for example, displays a 
"Short Form" listing, which includes tabular columns entitled, "Patent Number," "Assignee," 

10 "Issue Date," "Inventor," "U.S. Class," and "Title." Obviously, other coliunn headings are 
possible, and these headings as well as the number of columns shown are settable by the user. 
[0044] The type of format is selected by clicking on icon 62 as shown in Figure 16. For 

example, while viewing the "Short Form" listing shown in Figure 16, one could click icon 62 to 
view a "Long Form" listing. 

15 [0045] Once a pool of patents has been identified, the program is capable of filtering the 

pool to focus upon a subset of patents. This is done by clicking on the Filter hyperlink 63 on the 
main screen. In Figure 17, for example, the Filter hyperlink has been selected, and the user can 
filter the pool of patents either by U.S. or Intemational Class, or by Assignee. To select a subset 
of patents that are all classified in the same class, one would simply click on an appropriate class 

20 hyperlink from the list of classes. If one selected U.S. Class 001, for example, a list of patents 
would be displayed as shown in Figure 18, where U. S. Patent No. 5,743, 5 53 is displayed, and 
this patent is classified in U.S. Class 001. 



[0046] Another data mining tool of the present invention is entitled Classv Cousins, 

which, again, is selected by clicking on the Classv Cousins h yperlink 64 on the main screen (as 
shown in Figure 19). A "Classy Cousin" is defined to be a patent that shares the same U.S. Class 
and at least one reference cited therein with the focus patent. In other words, if U.S. Patent No. 
5 5,896,942 is the focus patent, and a user selects the Classy Cousins sort tool, then only those 
patents in the pool in U.S. Class 180 that cite at least one reference cited on the face of the *942 
patent will be selected. 

[0047] Finally, hyperlinks (identified by underlined text in the screen capture) enable a 

user to view the full text of the focus patent (Full Patent) ; a list of the patents cited by the focus 
10 patent (Cited Patents) : a list of the patents that cite the focus patent (Citing Patents) : the abstract 
of the focus patent (Abstract) ; the claims of the focus patent (Claims) ; or the digital image of the 
focus patent (Image). 

[0048] Once a pool of patents has been identified, perhaps the most usefiil tool of the 

present invention is the ranking tool. By clicking on the Strength icon on the main screen (icon 

15 65 shown in Figure 6) the program can perform an analysis of the patent in the pool and rank the 
patents according to strength. The metric used to determine strength comprises a weighted 
formula that includes several factors, including the number of independent claims, the total 
number of claims, the number of elements in the first claim, the number of elements in the 
exemplary claim, the number of references cited in the patent, and the number of patents that cite 

20 the focus patent. The weighted formula is reprinted herebelow: 
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Strength Rating = A (B x Number of Claims) + (C x Number Of Independent 
Claims) + (D x Number of Citations) (E x Number of First Claim Terms) (F x Number of 
Exemplary Claim Terms) (Gx Number of First Claim Elements) (Hx Number of Exemplary 
Claim Elements) 
5 where, 

Number of Claims = total number of claims in patent being evaluated 
Number Of Independent Claims = total number of independent claims in patent 
being evaluated 

Number of Citations = number of patents that cite the patent being evaluated 
10 Number of First Claim Terms = number of terms found in first claim of patent 

being evaluated 

Number of Exemplary Claim Terms = number of terms found in exemplary claim 
of patent being evaluated 

Number of First Claim Elements = number of elements in first claim of patent 
1 5 being evaluated 

Number of Exemplary Claim Elements = number of elements in exemplary claim 
of patent being evaluated 

[0049] Obviously, the "strength" of a patent is a subjective determination. Some 

attorneys and business people might be of the opinion that a patent with more claims is stronger 
20 than a patent with fewer claims. Others may believe that it is the number of elements in the 
broadest claims which determines strength. On the other hand, a pioneer patent which is cited 
often by subsequent patents may be "stronger" than a patent with only a few elements in a weak 
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or old technology area. In view of this subjective assessment, the formula of the present 
invention is preset with various weighted factors, but these weights may be adjusted by the user. 
The user can set one or more factors to any weight, including negative weights. 
[0050] In a preferred embodiment, the software calculates the strength of a patent based 

5 upon the predetermined formula, where the coefficients are set as follows: 



A= 


1.0 


B = 


4.0 


C = 


8.0 


D = 


-0.1 


E = 


-0.1 


F = 


-0.1 


G= 


-0.1 



[0051] This predetermined formula defines the "Mapit Strength" of the patent, as shown 

by the formula in Figure 20 labeled "Definition of Mapit Strength". Figure 20 also shows a 
15 listing of patents in a pool sorted in descending order of "Mapit Strength." 
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[0052] A user-defined formula is shown in box 90 in Figure 21. In this formula, the user 

has set the formula coefficients as follows: 

A=1.0 

B = G.O 

5 C = 5.0 

D--2.0 
E = -2.0 
F = -4.0 
G = -4.0 

10 [0053] Figure 21 also shows a listing of patents in a pool sorted in descending order of 

"User-Defined Strength." 

[0054] It is also possible to display the ranked patents in order according to any of the 

factors in the formula. For example, Figure 22 lists the patents in descending order according to 
total number of claims. Figure 23 lists the patents in descending order according to number of 

15 independent claims. Figure 24 lists the patents in descending order according to the number of 
citations appearing on the face of the patent. Figure 25 lists the patents in ascending order 
according to the number of search terms in the first claim of the patent. Figure 26 lists the 
patents in ascending order according to the number of search terms in the exemplary claim of the 
patent. Figure 27 lists the patents in ascending order according to the number of elements in the 

20 first claim of the patent. Figure 28 lists the patents in ascending order according to the number of 
elements in the exemplary claim of the patent. 
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[0055] As mentioned previously, the software is compatible with various date formats, 

including American and European formats. As shown in the screen capture of Figure 29, a drop- 
down box allows one to choose between short form mm/dd/yyyy or dd/mm/yyyy format, or a 
long-form format where the name of the month is displayed in fiilL The long form format is 
5 shown in the listing of patents in Figure 30, and the date format box is shown completed as 
"January 31, 1999" in Figure 31. 

[0056] The program determines the total number of claims in a patent under study by 

counting the number of "Claims" rows in a database file. These electronic files are available 
directly from the United States Patent and Trademark Office. There are a number of ways in 

10 which the total number of claims can be determined. For example, one can simply identify the 
claim number at the beginning of each claim (while ignoring the claim number which appears in 
the body of dependent claims), and then count the total number of claims. Since each claim in a 
United States patent must necessarily comprise a single sentence, one can also count periods to 
determine the total number of claims. Additional parsing/counting clues can be obtained by 

15 looking for dependent claim language. 

[0057] To determine whether a claim is independent or dependent in nature, the program 

looks for any conventional phrases used by practitioners to indicate claim dependency. Some of 
these phrases are suggested by the USPTO. Some examples include "Claim N," "Claims N-M, 
and 0-P," "Claims N through M, inclusive," or "any previous claims" where N, M, O, and P are 

20 numbers. The program acconmiodates for capitalization (e.g., "claim" or "Claim") and common 
misspellings (e.g., "Clam" instead of "Claim"). In many cases, it is sufficient to merely search 
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for the word "Claim" in a patent claim - if this word appears, the claim under study is likely 
dependent in nature. 

[0058] There are several different methods used to determine the total number of 

elements in a claim under study. The following are four examples of how the program 
5 determines the number of elements in a claim: 

1 . Look for clauses or phrases, and count these as elements; 

2. Look for ordinal notation ("a, b, c" or " 1, 2, 3 ", or buUeted lists, etc.) and use 
this information to count elements; 

3. Analyze punctuation such as semicolons, commas, parentheses, brackets, 
10 dashes or quotation marks, which may indicate element boundaries. For example, since all 

claims comprise a single sentence, practitioners commonly separate claim elements by semi- 
colons. 

[0059] Another metric relates to a weighted count of linguistic or textual components 

found in the claim(s) under study or of correlations among such components. These components 

15 may be determined via analysis at various linguistic levels, including phonological, 
morphological, lexical, syntactic, semantic, discourse structure, or pragmatic levels. 
[0060] For example, discourse structure of patents can be leveraged in the case of 

"means" language or specification-level definitions leveraged in the claims. The number of 
distinct semantic notions (at a particular level of a semantic hierarchy) could also be counted; 

20 semantic notions at different levels could instead receive different weights. Another example in 
this continuum is to count the number of nominal and verbal phrases or terms and combine this 
with information regarding counts and positions of modifiers such as adjectives, adverbs, and 
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modifying phrases (e.g. prepositional phrases) or terms. Symbolic and formatting information 
can also be leveraged; punctuation, horizontal or vertical spacing, and ordinal information 
representations are some examples. 

[00611 One simple approach is to count the number of "terms" in the claim or claims. A 

5 term, generically, is a string of characters, letters, numbers, symbols, or combinations thereof 
Various levels of normalization may or may not be applied to the terms. Examples of these 
follow: 

1 . The program normalizes case of letters (e.g., uppercase all letters); 

2. The program "stems" the terms, removing some or all suffixes and/or 
10 prefixes and obtains a "root" of the term; 

3. The program removes non-unique occurrences of terms. This can be 
accomplished across the entire claim, or just within certain phrases. This can alternatively be 
applied only in cases where the terms "said" or "the" or other textual clues link the multiple 
occurrences (in patent claims, the words "said" or "the" preceding a noun usually means that the 

15 noun was mentioned previously, i.e., has an antecedent basis); 

4. The program weights occurrences of different terms differently and may 
use a stop-word list to provide a weight of zero to certain terms. 

[0062] Figure 32 illustrates one embodiment of an apparatus operatively arranged to 

implement and run the computer program of the present invention. The apparatus is seen to 
20 include a special purpose computer programmed to perform the program disclosed in the 
microfiche appendix. The computer comprises central processing unit (CPU) 91, random access 
memory (RAM) 92, hard disk storage drive 93, keyboard 94, monitor 95, and network interface 



96. All of these components commiinicate with one another via communication bus 97. The 
computer may be a stand-alone personal computer, or a client or server computer on a network. 
It may also be a mainframe computer. The computer program of the invention, a source code 
listing of which is attached in the microfiche appendix, may be contained on hard drive 93, or on 

5 any suitable storage medium, such as a CD ROM, magnetic tape, optical drive, or floppy disk. 
[0063] Thus, it is seen that the objects of the invention are efficiently obtained, although 

changes and modifications to the invention should be readily apparent to those having ordinary 
skill in the art, without departing from the spirit or scope of the invention as claimed. Although 
the invention is described by reference to a specific preferred embodiment, it is clear that 

10 variations can be made without departing from the scope or spirit of the invention as claimed. 
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